Yeah, still doesn't feel natural. I I wish, you know, the the the the room would just detect when an when a meeting's started it starting. Voila cool. So you better start. It was your idea. So. Well maybe uh is is the white-board working? Or or not. Um y maybe show us the general sort of outline. You keep w okay, you can just write it on the paper. Yeah. So is this different from the original P_L_P_ two stuff then, is that? Uh 'cause that's what you started with, wasn't it? This this stuff that Marius Okay. Okay. Yeah, yeah. H he's parameterizing it both temporally and across the spectrum, yep. Yeah. Yeah. Yep, okay. Yep. Yeah, to that. Yep, yep, yep. Mm yeah, but it's a it's a it's a different linear prediction model, isn't it. It's a complex linear prediction or whatever. Yeah. Yeah. Yeah. Yeah. Yep. Okay. Okay. You need the phase. Ooh uh Well Well it depends on how this i I mean you're talking b yeah, just the carrier. Okay, so it's not necessarily phase, but if you're using some sort of sinusoidal coding, it's phase, yeah. Okay. Okay. W What so what sort of bit rate do you get for the um encoding of the this the Hilbert transform? The s the spectrogram. W what sort of coding do you get so far for the no no no no, f f yeah. Yeah. Yeah, yep. Uh wo well, it's no no no, but what sort of bit rate do you get? Is it is it uh id do you have any sort of m idea of what sort of band-widths? Yeah. 'Cause Okay, and that's in that's with the fully optimised sort of compression scheme or that's just the raw data or? Okay, yeah. Yeah. Sure, sure. You just cut out m bands and just use interpolation or whatever. Yeah. T so that's one point okay, that's with the full spectrum basically being used then. Yeah. Yep, yep. Yep. Yep. Yeah, yep. Okay, that's quite good. So and Hynek was showing me how scalable it is in terms of um you can take half the bands away and still get decent performance. You take you take three quarters of them away, it starts to degrade, but it's still intelligible, that sort of thing. Yep. Okay. And that's without doing any sort of um prediction of what the missing bands might be, right? You just leave them out. I mean you you shou you could be able to do some sort of code missing data sort of Bayes stuff, yeah. Okay. Okay. Yeah. It it would be it would be nice to have a scalable scheme though, wouldn't it? Like because you've got a nice scalable scheme for the for for the the coding of the the smooth spectrum. It'd be nice to have some sort of scalable coding scheme for the for the voice signal as well, right? Well, I don't know, I'm just Well, at different bit rates. So the actual uh excitation as well. Because I mean it'd be nice uh l let's say if you had twelve hundred bits for the for the sp smooth spectrum and twelve hundred bits for the voice, and then you could scale them both. Yeah. Sure, sure, sure. Yeah. Uh No no. Just long as it's some sort of non-linear. Yep. M The place more emphasis on the Yeah, yeah, yeah. Okay. Yeah. Okay. Hmm. And then vector quantize or whatever. Yeah. Hmm. Hmm. Mm-hmm. So times fifteen. Twen that's t it's twenty times fifteen, it's not Okay. So it's it's yeah, it's three hundred. It I mean it's it's total of three hundred sort of parameters per second. Um Well, y you're not you're not carrying transmitting that at all really at this point in time. I uh And and you get whispering type synthesis, yeah. Yep. Or one of these glottal pulse things. F and just use basic L_P_C_ ten style. Yeah, which is which is probably, I mean, if you're going to aim for like a very low rate code to below five hundred bits per second, I guess. That's that's probably fine. But Uh the lowest standard is MELP I think, which is twenty four hundred, the standard. It's been implemented at twelve hundred f bits per second using more more complicated quantization schemes and stuff like that, I think. I think so, yeah. I s I saw that they used matrix quantisation on the L_P_ uh on the L_S_F_ parameters or whatever. But I mean that's quite it becomes quite computationally intensive. But I mean I imagine the same sort of approach could be taken with yours as well. No. Yeah, that's the same. Twenty MELP*'s a lot better quality though. Yeah. ..No no. I think I think I think I think CELP or CELP or however you pronounce that. It's about forty eight hundred I think. Yeah. Well the standard's forty eight hundred or something, yeah. Yeah. Or based around that. Yeah. Okay. It's not terribly interesting though, really. Yeah. So I'm Well you you've well, I mean you've got a bunch of options, haven't you. You can use like uh a codec sided or you could use a Sinusoidal based. No. Yeah, yeah. Sure, sure, sure. Yeah, but what's more interesting? I I mean 'cause c I I I suspect there somethings like um CELP. You're gonna get h lot higher band-width, aren't you. I mean it's a high band-width standard. So yeah. Okay. Yeah, yeah, yeah. Okay. Hmm. Okay. So I mean How Yep. Yeah. 'Cause it's a bit tricky, isn't it. That's it's not like a standard I'm just thinking, I mean what sort of opt it's not like you can't just do standard um, you know, inverse filtering for um signal reconstruction or anything like that, can you? Uh oh okay and then you just then you just do the inverse F_F_T_ or whatever. Okay, the oth I mean the other o Mm yeah. But that's gonna be susceptible to having phase problems, is that, when you Yeah, yeah. Yeah. Okay. Yeah. Hmm. And now I mean there is there are um mm there is research uh um pu that's been published showing, you know, speech reconstruction from the essentially the spectrogram without phase information. Yeah. So Yes. Yes, but I mean you can reconstruct the the f the th the entire Spectrogram from that, can't you. So you could do it that way. Cause I'm just saying there's a heaps of there heaps of options. Like if y you be exactly the same as sinusoidal coding or something like that. All you need is the reconstruction of the spectrogram in order to estimate the harmonic amplitudes and Yeah. Yeah. Ye Just by refitting. Yeah. Yeah, yeah. Okay. Yeah, okay. D it's th Yeah. But that's that's the big thing at the moment, isn't it. If if you're mm if i if the if the exci if the if the encoding rate's two kilo-bits per second for the the excitation component of the signal. It doesn't matter how efficient your um encoding of the spectrogram is, you're always gonna have a high relatively high bit rate.. Yeah. Nah. Yeah, yeah. Yeah, sure. Well, one could argue it's it's a field that's quite a sort of a niche these days, isn't it. Um In standards and yeah. Mm yeah. Well did you say uh who said Yeah. Is it Nokia's being sued in the U_S_ for their tri-band usage 'cause, you know, intellectual property. Uh no no, g G_S_M_ because they don't have G_S_M_ in the U_S_. Or I didn't have G_S_M_ in the U_S_. UH it's f uh okay. No no n Uh well that's tri-band, s yeah. Yeah, yeah. Yeah. Yeah. Yeah yep. And apparently Nokia's in a bit of trouble over that one, but yeah. Well it's interesting. Yeah. Should've been a lawyer. Well that's But I mean coding is a much more sort of it's it's it well I mean it's it's uh it when it boils down to just mathematics to a large extent, doesn't it. It's it's just uh sort of it's, you know, communications s sort of No. Yeah. Sure, sure. Yeah. Sure, sure. Push to talk and stuff like that. Yep. And I mean, as you say, I mean that delay is, I mean you can squash that delay to some extent, right. I n I think as far as the co you know, c c Delilah's coding goes, I don't think you can probably do much more than what's already been achieved. Because to get higher compression you have to, you know, take advantage of redundancy over longer time spans. So Yeah, sure, but I mean that's transmission delay, not algorithmic delay. If you add those together you end up with a the nightmare scenario, I guess. We might have someone else joining our uh meeting soon. Oh yeah. I don't know, I mean uh you'd be you tried sinusoidal coding based approaches, didn't you? Sinusoidal coding. Yeah, but sure. Yeah, m yeah, tr yeah it's a yeah, but I mean the th the the thing is you'd need ti it part of the parameter set for that is uh the um uh the f uh the harmonic amplitudes. And then one way of compressing the harmonic amplitudes is just storing the smooth spectrum. And then you can uh and then you can interpolate yeah, you can just pick the the um the harmonic amplitudes on that spectrum when you reconstruct on that smooth version of the spectrum when you reconstruct. Y yeah. Well, I was just saying Yeah. Yep. Yeah, well I I th it's high. It's quite a high bit rate. But I mean once we it maybe it's something that's scalable. Well it I mean you can use as many harmonics as you want. I m h uh he was just using the fundamental frequency. And if you smooth it doing a smooth spectrum representation of the harmonic amplitudes, then you only need to store the the the fundamental frequency and and the and the maximum voice frequency. And everything else can be yeah, s yeah, sure. So, and then but then the tricky the only tricky thing that we hadn't figured out was the phase information. Um but y but uh you can still get a like you get th um basically if you do phase unwrapping and all of these other tricks, you end up with some sort of phase representation, which is some sort of monotonically decreasing function. Yeah, unwrapped, yeah. So I mean it's something that you could conceivably represent in some sort of low sort of r you know, low bit rate representation, I guess. And then maybe you'd need some sort of correction to um to correct minor phase discontinuities at some point in time. 'Cau Yeah, yeah, sure. Yeah, okay. Okay. Yeah. Yeah. Okay. Yeah. Sure, sure. Yeah. Yep. Yeah, yeah. Hmm. No, and you uh you wanna avoid hard choices on voicing decision, yeah. Well that's one of the big problems with L_P_C_ ten for instance. I it stinks because this hard yeah. Yeah, sure. Yeah. Yeah, yep. But but that that two hundred this is what they use for like um st they they do use this for um for um pitch marking in speech. So this is what Alan Black's code uses. But it's very speaker dependent wh on what sort of band width you'd look at this Hilbert envelope. Depending on the pitch, depending on the person, y you'll get a different amount of smoothing and a certain sort of so I mean, you know, if it's a female speaker for instance you yeah. So it's sorta tricky. Yeah. Uh n no. Y yeah, but from memory that that basic script that they had wasn't wasn't very um effective. I mean maybe if you did some sort of initial first p guess and then and then chose your smoothing factor based on that. 'Cause it it you know, selecting at any particular point of what the what the pitch is at the may be inaccurate. But if you estimated it over a second, you should be able to get a decent estimate, I'd I'd think. Yeah. Well it'd be nice to it would be nice to really see what this what you're a actually trying to reconstruct here. This carrier signal, what what it really looks l yeah. And you did say it looks like a frequency modulated cosine. Hmm. Yeah. It would have to be 'cause pitch shifts yeah. Yeah. Yeah. Hmm. Hmm. Mm. Yeah. Mm-hmm. Hmm. Yeah, it's like a yeah. And it it does start to get a little bit n non-linear towards the end. But normally that's that's when the components are not actually probably gonna be audible anyway in voiced speech, right. So yeah. Yeah. Yeah. Yeah, sure. C C 'cause any sort of error in that in that line is is transmitted, is realised in a net as a phase error, right. Yeah. Mm. Okay. Mm yeah. Tricky, yeah. Yeah, yeah, yeah. Y y y you have s five slices like n across that way, I guess. Yeah. Hmm. Yeah, that makes sense 'cause it's a time domain sort of approach. Um and I mean uh and and in in fact I mean it's what you're doing is so different to most approaches to speech coding that it doesn't matter that we don't know much about or haven't had much i it's I mean it's quite different to a lot of the traditional sort uh so uh voi um you know, source filter type approaches or or whatever, yeah. So It's just a matter of minimum distortion time. Yeah. Sure. Unvoiced unvoiced speech is very difficult to b sort of judge on quality compared to voiced, I think. Plosives. Yeah, plosives are very important though, I guess. So maybe it does a better job of plosives. Okay. It'd be nice if we had some demos here. Well, I suppose have to wait to your I guess. Oh it's a little wa oh bugger. I won't be, yeah. No. You'll practice it before then, won't you? Yeah, he's going to Oz. Yep. I did, yeah. But we're d we're doing a more traditional s s frequency slices rather than temporal based stuff. Yeah. Well that's w that was one of the approaches, sure. Yep. And the other approach was was this H_M_M_ based stuff, which is which is basically just a parameterization of the, you know, the the the the um the source filter type parameters, and then and then learning the, you know, training like a h speech recognition system. But that's not terribly that's got no relationship to this. I it's yeah. Yeah, sure. Speech coding. I I read some of the other techniques. Um little standards. But um I mean I think I think you're on the right track with the analysis by synthesis but that doesn't tell you how what sort of bit rates you're going to achieve or um and because you have to do it separately for each um band, it would be the I imagine that's the tricky part. Um because it would be varying quite rapidly, wouldn't it, your um your uh code book. Like w within each um when you go along the the the b the one second sort of segment of the band, you Yeah, yeah. Yeah. Well, I mean yeah. First you've gotta choose your approach and not worry about the band-width, and then start thinking about how to given 'cause I mean I guess especially 'cause this is a this is a more of a commercial based project, you've g c you've you've gotta i match the quality before you start worrying about other things, right. I mean d d do they I mean what what what sort of uh Uh-huh, okay. Well that makes things uh With They're happy. Okay. Cool. Yeah. Sure, sure. And at the moment you're at sort of a point where maybe you have to make that choice a little bit. Yeah. Mm-hmm. Yeah. Uh which is not Yeah. Yeah. Al although the standards, depending on what standard you're dealing with, sometimes it can be quite ambiguous, right. I mean all the M_ PEG standards are basically this is the container, we don't care how you do it, right, but it's gotta fit inside this this sort of this sort of format. Okay. And then you have the v and then you have to do M_O_S_ um testing and Okay. Well, ho it Okay. Mm. Yeah, yeah. Well, I th No, no, no. Uh no, no, no. Yep. Well the the what's what's new is in the transmission medium. Like you know, new like higher band width transmission or uh or like packet packetized based transmission. And then all th that sort of thing. The where th where they're they're not so much looking at the algorithm, but so much as how to incorporate the algorithm within a new transi mission sort of s medium. Basically s like I mean if you d if you see a lot of papers on like um on uh voice over I_P_ or whatever, the the publications are sort of like um how do we deal with packet loss, or or how was it affected by packet loss and and and things like that. Yeah, sure. Yeah. And optimization of the code and all that sorta stuff. Yeah. But even that's less of a big deal now, right, because all the little um I_ s yeah. Not really. You could trow Matlab on a mobile phone these days. Well, maybe not, but Yeah. Y yeah. Yeah, which David did, yeah. It's good, yeah. Yeah, we've got all his code implemented. So uh It wasn't perfect. There was little bit of bugs with some of the matching. But it was pretty hard to tell that it was m not not perfect unless you heard the samples sort of next to one another. So that's not too bad. Oh, sure.. And I mean it Well th The only trick is the phase. And we've got a student starting in January who's gonna start working on on how to actually efficiently parameterize the um the H_M_M_ yeah, yeah. And I think I think we've got a number of ideas for the the spectrum. That's easily enough handled just using standard techniques. But the Without phase? I've got no idea. Uh you reviewed a paper on a Uh Steven Cox, isn't it? Yeah. University of East Anglia. It's near c sort of it's it's near the sorta Cambridge side of England, I guess. No the much more mathematical, I think. Yeah, just basically doing a search through or or stuff stuff like that, don't they? Or Yeah. Yeah. No. Yeah, yeah. Sure. Where you've only got the M_F_C_C_s. But th I think I think the I think it was to was it to do with Aurora or something? The Okay. I was Okay. Yeah, but it w it was for say um say you've got someone who has you know, they wanna dispute that this is what they actually said by the recognition. And the recognition network was wrong and you owe me money or whatever. Um and then they can say well we've we've reconstructed your speech from the speech recogni input to the speech recognizer. Hey voila it sounds like you uh Uh I mean obviously there's a lot more to it than that. There's a lot of sort of, well, does the quality stand up to legal sort of requirements or whatever. But um Oh no, that there's a separate set of standards I think for speech recognition. Y yeah, yeah, yeah. Yep. Which uh I think requires, you know, that states the un the information they can send is this. So I don't know whether you tack on any extra phase information or whatever. You have to deal with what you're given. So Yeah. I don't know. I I it would be interesting. You I mean maybe that's the best approach to look at to begin with is just pretend all you have got is Yeah. Um because you can also interpolate the pitch information from the the spectrum as well. They do that. Yeah. So I mean maybe try to see how good you can go without transmitting any of that information. The pitch was really good, I thought. I th didn't think you could really tell. Yeah. Yeah. Yeah, but but the thing is if it's Mel scaled, if you have a Mel scale, you get quite fine reconstruction of the spectrum at the lower frequencies it which is what you need for reconstruction of the pitch anyway. So I mean if you wanted to transmit with fewer bands, you'd maybe still keep the lower bands or or maybe you could still transmit the pitch. The pitch isn't that big a deal. But it'd be nice if you didn't because then I mean that's well, I mean it's worth giving it a go and then men maybe seeing wh what information you can add to the data stream to improve performance. Um I don't know. That that'd be cool. So get coding. Mm I think w Ah, well I mean Huffman's easy enough. But but yeah, sure. Uh that that's where I stop. Run length encoding, I'm out of here. Okay. I mean I I th I think they do try to I mean obviously the the the techniques they they choose are supposed to be less, you know, susceptible to coding errors, let's say. You know, i but I mean that's maybe the the limit to that. Yeah. Okay. Okay.
Yeah, at least on paper. Yeah. Yeah, at least you can write on the p Yeah, CELP. Yeah. Hmm. No, but Marius is it's similar stuff. No, Marius is also using like Hilbert transform and then Both. Yeah.. Uh is some kind of D_C_T_ or again yeah. Yeah, ok Yeah. Yeah, yeah. Yeah. How c Mm-hmm. Yeah. Because use linear phase or Yeah uh Oh okay. Yeah. Yeah. Same frequency. Yeah. For L_S_F_. Main bit rate. What is Good. Oh. Ah, okay. Hmm. Hmm. S Yeah. Hmm. Hmm. Yeah, yeah. Hmm. Oh yeah, type. Hmm. S Hmm. S so Oh okay. Smoothed? Or like apply more Hmm. Ah. Hmm. So after you're doing L_S_F_s so or like I mean how you're finally hand coding Ah. Oh. Yeah. So first you apply these triangle filters and then do L_P_C_ coding or like Yeah. Yeah, okay. Yeah. Filters. Yeah. Oh okay. Hmm. You can do the L_P_ analysis on each sub-band, yeah. Yeah, m yeah, ok Yeah. Yeah. Yeah. Yeah. Yeah. So how many L_S_F_s you're using and Oh okay. So but you how many bands you're using? If you're you're not skipping any bands or like oh ok fifteen into twenty then. Like uh. Hmm. Ah. Oh yeah. But phase you're not encoding anything. Phase you're directly transmitting that. Uh no th Yeah, yeah, yeah. The phase of Hilbert envelope. You're not transmitting at all like. Yeah yeah, is noise. Fourier. Yeah. Yeah. So what is the CELP standard now? Like it's it's two kilo-bits per second? Or the Oh that two two kilo-bits per t Ah ok Ah. Some Yeah, yeah. But Yeah. Hmm. Yeah. But this is codec centred, no. You have code book. So Oh it's not codecs code book. There is not code book. Oh ok Oh. Yeah, CELP is Yeah, it's it's it's around one kilo-byte like. Yeah. Yeah, that's a, I think, even G_S_M_ and all this, G_ seven two nine. Yeah, yeah. On based on CELP. At least it gives But still they use a code book, no. So that's really uh it's maybe better than like using just random noise or Yeah. Yeah. Yeah. Yeah. Huh. Yeah. Ah, okay. Hmm. But Hmm.. Inverse, yeah. Inverse Hilbert. Yeah. Yeah. Or output signal like Yeah. And then yeah. Yeah yeah. Hmm. Oh ok Hmm. Hmm. Hmm. Hmm. But yeah but it's more towards industry, no. Like the yeah, because I worked in a industry for like a small time. But they're they used to implement all these coders, G_S_M_ and I_T_U_ standards. No, just before my P_H_D_. for like few months. Yeah, but that's uh like industry it's really a big deal like. So they they they work on a lot of these standards. Yeah. But mostly they're all based on CELP like mainly. Yeah. So How? Tri-band is G_S_M_. C_D_M_A_. No no, but C_D_M_ is there but now slowly they're introducing G_S_M_ also now in U_S_ like. Yeah, G_S_M_ tri-band. Like the frequencies are different, like very uh Ah. Yeah. Now it's really big competition. Yeah. Standard maybe like established. Yeah. So where do you think your thing is useful? In C_D_M_A_ or G_S_M_? I think maybe in C_D_M_A_ it may be m C_D_M_A_ is more like More time domain. The they they are c doing only Yeah. Yeah. Yeah. Yeah. But you know really the these long line telephones, they already have delay, no. At least half a second or yeah. Yeah. Yeah. Yeah. But mobile m And this one is Uh harmonics and the Yeah. And then picking yeah. Yeah. But maybe the bit rate is the problem, no. Like what David was doing. Like it's really uh yeah, yeah, yeah. Yeah, maybe you can just see how many how many harmonics. He was using twenty, I think. Like yeah, maybe you can try to reduce. Yeah. Yeah. And the f And then all this amplitude. Ah. Yeah, yeah. Yeah. Yeah. Each again. Some kind of linear yeah. Yeah. Yeah. Some function or like a Hmm. Yeah. Yeah. Oh. Yeah. The size of the But you don't do any voice un-voice decision, no? Yeah, the that's even advantage I think for this approach. Yeah. Yeah. Yeah. sorta the modelling, it's really another issue like. Yeah. Yeah. Yeah, yeah. Yeah. Yeah. And then you have to combine all this harmonic model and noise model. So again you may end up some. 'Cause Oh okay. The pe peak uh Yeah, yeah. Yeah. Yeah. S Yeah. Yeah. So that's a you have to tune lot of parameters like I am. But anyway, you don't really need hard decisions, no? Like voiced unvoiced. So m maybe it may help you, but it yeah. Yeah. Yeah yeah. Yeah. But do you really really need it or like voiced unvoiced? Yeah, but yeah. Yeah. But it's not really you need t you don't need to specify at each point it's voiced or yeah. Huh. Yeah. And then amplitude. But you're telling even frequency's also shifting, no? Like your delay. Yeah. Yeah, yeah, yeah. Yeah. So then, yeah, then it's not exactly amplitude modulations or it's it's in between maybe. Yeah. S yeah. So yeah. Yeah.. That's what Marius did, no, like. But but he was keeping all the phase information. No, but even the his coded like he just he he didn't do anything, he just transferred it that way. He was aiming for he's not really kind of doing speech coding, but he want to do it on music. So he doesn't really care about the band-widths or he want to have the quality, no, like. Yeah, but even especially the audio series and the that that's what he was telling like. Yeah, at least even ten K_ bit. Twenty kilo-bits also is okay. So but he's uh he's not I think finished with all that thing. He's just transmitting all the phase and then but he's encoding the envelope, yeah. Yeah. Yeah. Yeah. And then leaving the phase, yeah. Yeah. Yeah. Hmm. Yeah. Yeah. Yeah. Yeah, or Auto-correlation coefficient and Yeah, they again give the phase. Yeah, again give the phase. Yeah. Oh. Hmm. But this phase is different from your carrier, no? Like yeah. Because you're just doing on uh your uh yeah. And also like yeah. And Yeah. Hmm. But even you can use as a can you use that phase directly? Whereas the d yeah, this one is direct But this is phase of your d uh spectrum, no? Like it's not in time domain. So Of your signal. It's not so then it's bit tricky. Like how Okay, but still you get only the the magnitude of your Hilbert envelope. You won't get the No, but you started with Hil Hilbert envelope, no? Like magnitude of Hilbert envelope. Magnitude of uh no, before D_C_T_ your d oh. Yeah. Ah, ok so input to the D_C_T_ is the speech or like your yeah, okay. Just you're starting with Ah. Ah. Ok ok ok Yeah, maybe the then Yeah. Ah okay. Yeah, it's in frequency domain, no. Yeah, definitely it's d But now I think people Yeah. Yeah. So when is your turn? Uh Oh. Oh okay. I mean just before. Oh you.. Low bit rate. Yes. Yeah. Hmm. Hmm. Hmm. Oh okay. Yeah. Yeah. Yeah yeah. Yeah, you know, the the Yeah. Hmm. Yeah. Uh. No no no, yeah. Yeah. But it's mostly like this is uh s CELP and these thing. No, I was t uh doing some uh like writing some code for testing and these things where I'm making some t standards for that. So so th th there are like li the thing, these standards, how it works is like they give all the pseudo code kind of thing and then they put severe test conditions. So you have to pass those test conditions. Like then j you can say okay, we implemented D_ seventy eight or D_ se Yeah. Yeah. Yeah. Yeah. Yeah. Yeah. Yeah. So that's why it's really even the t standards they give the test signals also. You are t you can't really do it on whatever database you have. So so you are to use the uh uh yeah. Uh it yeah, it's more like uh they define. So I did for some even also. They're also they define all this the speech condition. Everything they define. So you need to follow all those. Yes, this industry is again different like, yeah. Yeah, I think there isn't only se most yeah. Ah. CELP is from uh A_T_ and T_, no? Like A_T_ and yeah, yeah. It the all this L_P_C_ stuff is Yeah. Ah, ok Ah, okay. Yeah. Yeah. Yeah. The Yeah, that's the main thing even. Yeah. Yeah. Yeah. Yeah, like I. Voice over I_P_ or yeah. Uh yeah. Yeah. Yeah. Yeah. And again, lot of this stuff like whe even when I was working mostly that company was developing for voice over I_P_. So they're to put all these things on again D_S_P_s. And also it's mostly like how to how many L_P_C_s are yeah, how many co yeah, it's optimation and then it's even Yeah. Again, now now now even they're also becoming yeah. Even the memories are not really yeah. Yeah. Yeah. Yeah. So even I think uh then you can have whatever band-width is. It's short really. But no, I think uh but still then that's what maybe people are going toward sign sort of modelling, no. Like I was before when the bandwidth conditions were really stringent then they were using all the p L_P_C_ based source filter. Yeah. Yeah. Yeah, if you want you can look at his code and. But only th yeah, yeah. Yeah. Yeah. But definitely it's better than co coded speech, no? Different. Yeah, it's really but uh how much you can get the compress these like what you're telling is you can do all these tricks like. Yeah, these yeah. In yeah. Well yeah, I read uh I reviewed one paper like yeah, the that's that's from no. Yeah, maybe you can look at dates from uh you know east of uh east uh yeah, East Anglia. Yeah. Uh this uh e east University of East Anglia, Ben Milner. Yeah. I yeah, it's yeah, I can forward you that. They're doing lot of tricks. They're like they're again training Gaussian mixtures for different frequencies and then but they can they're following the yeah. Yeah. Yeah. No no, the those things are again based on L_P_C_, no? Like mainly like yeah. Whereas like the pitch estimation in in the X_ waves or these things, they're all dynamic programming based on again L_P_C_ analysis. Yeah. And then construct that. Huh. Yeah, again like uh no, the problem is different for everybody. Like these guys, what they're doing for distributor speech recognition mainly, this uh so. Uh yeah, M_F_C_C_s. So No, it's E_T_S_A_ or something. European Standard yeah Hynek really n yeah, even it's h kind of Aurora framework. So yeah. No, but now it's bit. Yeah, now they're And then yeah, M_F_C_ s But even at distant point they're planning to use for j uh standard D_S_M_ or D_S_ it's not? Or Yeah. E_ uh E_T_S_A_I_, yeah. European, yeah, commission. So M yeah. Yeah. Yeah. Yeah. Yeah, i yeah, all these standards are really okay. Yeah. But Yeah. But the quality again problem, no like? No no. The the eu again, European standard, what this give us, they give the M_F_C_C_s and they give the pitch. Then you are to construct the thing. But if have M_F_C_C_s then like again you have to get the high order M_F_C_C_s and then yeah, then you're have to Yeah. Yeah. Yeah. Hmm. Mm. Yeah. But I think again it's different, no? Like these standards again, all these G_S_M_ standards. Maybe they have the compression standards and again coding stand but maybe uh I I read ah yeah. I don't remember anything like yeah, they're doing yeah, combi yeah, the even even G_ even normal G_ seventy nine, normal speech coder. So they have the speech coding standards again. They are these channel coding. So yeah. So it's mm. Yeah, b Maybe. Yeah. Yeah. Again, yeah, i So okay? Yep. Th thank you then. Oh.
You can write on the paper, yeah. It's it's okay. Right. Right, right. Mm-hmm. Hmm. Okay. I have a question. Is the scale linear or is Mel scale, like in M_F_C_C_? Is it bar scale, like in like in sp yeah, okay. The overlapping is different, right? Yep. Mm. Okay and? Okay. Uh-uh. Mm-hmm. Okay. Mm-hmm. I have never done anything. Yeah, tri-band G_S_M_, yeah. Different frequency. Yeah, exactly. Mm-hmm. Okay. True. Mm. Mm-hmm. I've never done any coding in my life. So I just have one curiosity. So this is coding then s you code the speech, then you're also supposed to transmit the speech, right? Th let's say in G_S_M_ or something else. I remember when I was doing some image processing, there were techniques for compressing and then coding. So adding channel information on the image at the same time in order to save sound bits. You see what I mean? Exactly. I d was just wondering if there's some s someone was doing it with speech as well. Well actually, when you compress yeah, all like first you compress the signal, right? And then you al you add com some complexity to the signal in order to recover the error. So I was wondering if by any chance there was something like this in speech with the speech. You don't know? Okay. Well It's not the most difficult, right? Sure, sure. And and this was the problem actually because you were No not not in G_S_M_, definitely definitely not. Uh Okay. Then th they some, you know, source. So yeah. Okay. Just one. Yeah. Definitely. Is it good? Okay. Okay, thank you.
Oh my God. Yeah.. Okay. I should start? Yeah, no so I thought that we might s t speak a little bit about speech coding because nobody is doing here speech coding and I would like to have some maybe feedback because except me. I've I've supposed to some speech coding stuff here with Hynek uh but you know that little bit at least that uh based on some let's say Hilbert transforms using a longer temporal context and deriving some parameters would be tranf transcs transmitted to a decoder and So it's a little bit different than the stuff which people are using now, which is like a L_P_C_ based. And but we are at the beginning with everything more or less. So still um we can't uh d encode a speech. But of course there is there are many problems which we still didn't solve and the biggest one is quite similar to L_P_C_ stuff where people still don't know how to exactly encode a carrier or the source signal. We know what to do with the envelope or how to approximate the envelope, let's say. Um when you have got Oh you mean that it would be nice to plot at least s Um maybe I I can I can So what we are doing now, yeah, you mean. Okay, i uh everybody knows L_P_C_ stuff, how it works. So I don't have to present anything mores. But it's uh m more or less very simple. We have let's say at the beginning there is some speech signal. What we are uh we might see it uh in very simple way that we are using some Hilbert transform h at the beginning to create analytic signal. The analytic signal is the signal which is a complex which you can create from the uh real uh uh some sequence. And if you get if you apply Hilbert transform on this, on such a speech signal, like a real trajectory, then you get a complex signal and if you take just the amplitude, then you get Hilbert envelope code. So I well I I don't know. Yeah, let's say ups. Yeah, it's u it's not P_L_P_ at all, d Mm. No he's it's pretty much the same. Uh you're lit I know what you mean with the pu P_L_P_. They are applying P_L_P_ with some iteration with this this one approach. So they are doing some smoothing of a spectrogram in two ways. One was right. Right. But P_L_P_ is using just the spectral uh direction. Let's say you are doing uh for every twenty millisecond processing one one feature vector. Here you are processing one spectral band, frequency band, right. So it's different direction which we are w using. But anyway, this is very simple. So applying that uh absolute uh absolute operation, you get a Hilbert envelope. And of course then there is the phase. So it's just the complex operation. And that's more or less all what we are doing. So now what we know what we know is how to approximate that Hilbert envelope. We are ap we are applying that Hilbert uh him what is the Marius' uh stuff. So he knows how to encode uh the Hilbert uh envelope using like L_S_F_s or L_P_C_. You are applying again to some linear prediction model after that. But it's uh right, it's in a it's a little bit difference in not applied in the frequency domain. So instead of classical L_P_C_ domain, where you have a spectrum, let's say something like that, and you are applying some linear prediction model which is trying to approximate the spectrum, right. So this is frequency. In our case we are we are trying to approximate s kind of instantaneous energy. So if this is a time, and you get one and add one frequency band, you can see something like this. And we are trying to approximate that envelope which is that Hilbert envelope. Right? So again, we know what to more or less we know what to do with the Hilbert envelope. We know that we can apply linear prediction model and it works pretty well. But i if you want to really construct the speech signal or the you need also the phase signal, which is in uh some sense quite simple for voiced speech signal, say. But s uh the phase no, you ac It's a phase or you can encode it like a carrier. So the carrier is, if you know, uh amplitude modulation, right? You may see something like there is one cosine which is uh very would have quite a f right. Yeah. It's called like the phase. Right, right. Exactly. So the this is the signal which you also have to somehow encode or transmit from a code of the decoder in order to re-synthesize it, right. So What do you mean? Again. For this one? For the the linear prediction model? Well it's uh just a linear prediction. Classical way. I mean oh, okay, w well, if you want to see it in more details, I mean Oh, you mean uh yeah, bit bit rate you mean? Okay, yeah. Sure, sure. So whereas we are what we are now is, let's say, something like one point two kilo-bits per second for uh that Hilbert envelope. Is It's again a roughly estimated somehow. But we still know that we can go down and right. Exactly, yeah. I don we are using like fifteen bands. Right. But they are first they are inter oh well, integrated into fifteen bands using oh classical way like in M_F_C_C_s or in P_L_P_. You just apply some those uh triangular windows or something like that in order to to get a li really like just fifteen bands, right. And each band is then processed independently. So for one band we more or less have w like w one hundred bits per second. For Hilbert envelope. Right. Right, right. You yeah, you can just right. Right. Right. Yeah. Uh right, that's what we did. Yeah, sure. Sure, sure. Yeah. There are many things we would wouldn't at all. But yeah, exactly. But now what I r really trying to do is somehow approximate that uh carrier signal, to do something with the carrier. There are many simple things which you can do like uh quantization quantization, for example. Right. You mean something like uh f uh that CELP coding system where you're or wh wh wh what is no, what is exactly scalable system? What you mean by that? Oh yeah, yeah, sure. This was yeah. For dif yeah, right. Right, right. Yeah. But I think this appr approach can do that because you really can split it into several bands. And you can decide how many bands you want more or less. So Yeah, this is a bar scale we are using. Yeah, or Mel Mel scale, so it's uh it's being uh more wider for higher frequencies and of course the s the spaces right. Yeah. But what I w my sense is that it is not so much important still. I mean it doesn't matter if you're als use Mel scale or bar scale. But right, yeah. But that's what we are using, yeah. Uh actually we are using just Gaussian windows here instead of those triangulars because they are more uh no, it's better than for decoding it's better to play with them because they they are not zero anywhere, you know. They are they are approaching or in they are in zero. But actually they are not. So d you don't have to care about some zero values here and but more or less it doesn't matter, yeah. We are using just Gaus Gaussian windows. Okay, so l those are that the Hilbert envelope is um is approximated by linear prediction. So you have L_P_C_ coefficients which you can transcribe into L_ L_ L_S_F_s, which are right, exactly. Yeah. Right, exactly. So first uh the approach is quite sim simple. So at the beginning we apply D_C_T_ in order to get to frequency domain. Then you apply those uh triangular or Gaussian windows. So this is done for one second, eight thousand samples for eight kilo-hertz sampling frequency, right? So for such D_C_T_ trajectory you apply those uh Gaussian windows. And then i so you split it in two frequency sub-bands. And then each one f frequency sub-band is more or less uh Right. Right. Exactly. You just apply L_ L_P_, so auto-correlation let's say, or you can do power spectrum, whatever. Yeah, so you for each one you get L_S_F_s here. Uh well, I'm using now like twenty L_S_F_s per one second, per one frequency band. Fifteen. No no. If I just keep all them so twenty L_S_F_s, each one is um is quantized by four or five bits. Of what? Three hundred bits? Two. Yeah, yeah. Yeah. Yeah. Exactly. You mean this phase, the the Hilbert carrier? Or no, of course we did Yeah. Uh w well, there are many things you can do. We don't well the first thing what you can really do is just to replace it by Gaussian noise. So you can really use t like in uh L_P_C_ system. Right. Exactly. Or you can replace it by some cosine, one cosine generated in a cen centre frequency of that Gaussian window. Right. Yeah. It mi it mi yeah. Twelve hundred? Yeah. Yeah, it's possible, some of it. Yeah. Mm-hmm. R right. Well I'm just comparing with uh L_P_C_ ten standard, which is like classical L_P_C_. A little bit optimized. So there in the there is I think two point four kilo-bits per second, the bit rate for that. Yeah, definitely. This is no, uh I mean L_P_C_ ten is just L_P_C_. No no no. No. Just noise or just those impulses. CELP. Yeah there yeah, yeah. Yeah. Yeah, yeah. Four point eight or something. Right. Yeah, sure. Well more or less what we didn't do is analysis by synthesis, right. So that's what CELP is doing, right. So yeah, so this is what we do and uh that might help a lot, of course. But w yeah, exactly. That's that's the question. It will be. Yeah, yeah. Yeah, sure. For example, uh there what I also did was uh a simple peak peaking algorithm. So if you take a look on that spectrum of that of that carrier signal, which is more or less like the cosine if you really take a look on that. But it is kind of frequency shifted cosine or frequency modulated cosine. Right, so it's th the frequency still to be changing. Or of course the amplitude is just not constant all time. It should be theoretically. But practically it's not. So what you can see for a one second signal, you m more or less you already might see some like one one uh impulse there or spectral line for. But usually there are many others which are s quite small, but they are quite important. If you don't transmit them, then the signal is quite robotic or something like that. So if you're really transmitting just one spectral line. But this is what you can do. And you can understand pretty well. The signal is audible. Or if you're adjust uh th find one one line, one speaker and dom dom dominant one in the spectrum of that c uh carrier Hil Hilbert carrier, and you just transmit this parameter. No, n It's even easier because what you have to do is just divide the you just multiply that Hilbert envelope by the table carrier. That that's all. Yeah, yeah. Yeah, exactly. So so you just multiply that uh one Hilbert carrier, which is transcribed by L_S_F_s by that this one I_F_F_T_ signal. So Oh f you mean the phase of that uh of that carrier? Of of course I try to play also with that, and uh I found that it doesn't matter so much which what is the phase because we are using just one second window, right. So of course there there might be uh some uh disturb uh disturbed uh when you concatenate those segments together. But then the phase might be important. But otherwise, I mean, I didn't find any anything important with the phase. Like great, keep the phase zero and it works. Or Right. Right right right. Yeah. But it's again a little bit different, because they are i they are doing this in a frequency domain. But this is in time domain, more or less. Right. The spectrogram, you mean. Yeah, yeah, yeah. Yeah. Right. Right. Exactly. What we can also do very simple is just to get the magnitude spectrogram using this approximation without the carrier, just Hilbert envelope. But you just uh get it in in this way, in in the way of Rob's, right. But you can also get the s short time Fourier transform uh and keep it right. Just classical uh frame by frame algorithm, twenty milliseconds. Uh but you keep just the phases instead of magnitudes. And you can put this together, right. And it works pretty well, and then the signal is not uh just uh whispered. Then you really hear that there is the voiceness in the signal. But uh of course the quality is uh is not so good still. But uh you can use it for like two, three kilo-bits per second and it works. Like yeah. Right. Right. Yeah, sure. Anyway, so um hmm m maybe it would be nice if you t if you tell us uh what you do with the speech coding 'cause I d I didn't see any of P_H_D_s of you, I think. So if you did something so maybe it would be interesting at least to Yeah. Yeah, sure. It's true. So you were when when did you do that? Oh okay. Uh-huh. I think so, yeah. You mean using that uh C_D_M_A_? Or Yeah, it's not G_S_M_. Right it's uh kind of s no C_D_M_A_ is uh another level, I think. Mm-hmm. I think there is a l lot of money in that like, right. Like speech recognition, of course it's very interesting, but people don't use it so much as s just coding, just telephoning, right, or just Right. Yeah. They s yeah, yeah. Right. Right. No, I I. Well what we were what we were thinking about is oh, C_D_M_A_, I mean there is the b Well, the problem is this in with this application, or it doesn't have to be problem but one constraint is that we are processing one second of the signal. You can go down of course. But then you are losing little bit of the bit rate, let's say, right. So but there are many applications where you really can use it because you don't care about algorithmic delay, one second algorithmic delay, like for I don't know if instead of S_M_S_ you can really have some channel where you just right. Um so probably. Right, right. Yeah. Right. Yep. Right. What if s Which one, s oh Well, but the sinusoidal coding is usually m when you s at least what I think or uh what I know is when you are doing sinusoidal coding it means you just take the speech signal at the beginning, right. And you are trying to find to uh yeah, to find the harmonics in a temporal domain, right, for that. Right. Oh. Yeah. Right. More or less I did it maybe different way, but kind of algorithm like okay, trying to find out the strongest or the the most dominant peaks in the spectrum, which is more or less the same. But th But you know, the the f the interesting Right. Right, and that that's it. And then you do multiples of right. Yeah. Yeah. Like it If it is unwrap you mean. Unwrap uh phase. Right. You l right, yeah. Yeah, yeah. I also play with that. And for example what I found i well, in this case uh we are not trying to encode the speech signal. We are trying to encode some carrier, which is much easier, I would say, because you can really im you can see that even. So what I was trying to encode at the phases, so I just found that uh two bits is enough for that and you really don't hear the difference. So if you keep all the spectral lines as they were there but you were just uh quantizing with two bits or three bits for amplitude, two bits for for phase. So it's okay. The problem is that you have to transmit many of them. 'Cause there is not just one dominant component but usually more others which are quite small, right. Or what you have to do then you have to decrease the the the size of not one second, but you have to go down because then you can get just voiced part and unvoiced part. You know, one second it's usually mixed right somehow together. Not now, no. I wi Uh uh we don't no want to do that, yeah, because it's quite tricky. Right. yeah, sure. Exactly. Right. But for example what we also found is that if you use just the Hilbert envelope, that that one which is trying to approximate that energy in some low band, like ar around two hundred t uh hertz, and you take a look on the signal, on the trajectory, you can see that it works like kind of um voice detector. Because uh for a low energies there is usually noise. For high energies there is kind of a harmonicity. So it works pretty well for that. So we might use this one information for that. But Yeah, I think so, yeah. Mm-hmm. Right. Uh-huh. Mm-hmm. Mm-hmm. That's true, yeah. Yeah. Yeah, yeah. Yeah, that's that's a problem. Sure. Uh f That would be nice n not to have anything, yeah. Right. Yeah, yeah. My Mm-hmm. Right. Exactly. Well I don't know. Maybe if you we uh really want to improve the the quality, then we will need it, I don't know, still. N not now. Yeah, uh well, I'm gonna t have a time next next month or in three weeks so I'll try to play some examples there for you and What we are trying to reconstr How does it look like or sure. Yeah, more or less. It's kind of uh not frequency. Let's say amplitude modulation when you have ju just the carrier and that modulation signal, right. Yeah, we yeah, yeah, yeah, exactly. Yeah, yeah. It's in between, exactly. So it's not this and that. But still yeah. But anyway what also the other approach which is possible to use. The other is one to keep the phase like this one, like two spectrograms, one the phase and one the magnitude. And well I can again play it and it works on two kilo-bits per second. Quite what uh no, what uh he was doing he was using it for speech recognition, what I what I know, but for speech coding no no no, it was I don't know. For speech coding you mean? I don't know. I don't know. M maybe. Right. Yeah. Mm-hmm. Well for example this phase information right. Yeah, yeah. I think so. There are m more papers which are using this approach, like trying to encode the magnitude spectrum or spectrogram. Uh the phase or you just uh re can again the phase can be replaced by the noise, which is pretty good. So that's uh the benefit uh variable approach. So if you really don't have a band width, you just can replace it by noise and it works. Of course then it doesn't sound as the original because it's more whispered or whatever. But you can understand well. And for example uh if you mention this one approach with the that uh trajectory of the phase, of unwrap phase recre uh I also use because you can do it here when you get into uh this domain where you apply uh or you're trying to compute uh s power spectrum in in order t let's say that this kind of trajectory, it doesn't in fit the same frequency over time domain. But here you are trying to compute uh doing a to do linear prediction. So we are just doing F_F_T_, right. Then applying uh power in order to get um to s to power domain, and then I_F_F_T_, right. So you get the auto-correlation coefficients. So before you're applying uh power spectrum, you can again get the you have got a complex f freq frequency or complex spectrum, you can get the phase, right. And the phase look quite similar like this. So you might see it's is more less like a line. Right. Yeah. Right. Yeah, it's it is very different, yeah. Then you are doing in different domain. I think this is in frequency domain, this in time domain. Um it is quite it's quite sensible to any quantization, what I found. Really it looks like the lines. So you just say okay, let's p let's put the line there. But it doesn't work. Re well, you can understand. But you can here that there are many artifacts which right, exactly. Right, right. And the another problem is that uh it's not band width variable. So you cannot say that this part will be the just the noise, which you can do in here in this this other approach. So that's why I didn't play with that so m more because I found there are more uh disadvantages than the advantages, this approach. So that's why. You mean this one? Yeah, if you take just the original phase and the Right. Yeah, it's it's it's not a phase. Yeah, yeah, yeah. It's the phase no, you can again put the that envelope with this phase together because envelope is twe No no. You g no, you get the f Yeah, you get the complex complex spectrum with this. No, no, I started with the D_C_T_ No, after D_C_T_ you are trying this is so kind of again a real trajectory, right. And you're trying to first find that Hilbert envelope. So uh let's well so this is a speech. No, but here what I want to say is really that y it's not part of Hilbert envelope, it's part of the D_C_T_ signal. So it's not it's enough if you have just this phase and f and the Hilbert envelope, you can put it together. You don't need the carrier. I mean this is kind of a carrier signal but in different domain represented. Right, right. So I what I think is that we already will need to apply s kind of analysis by synthesis approach mm uh in order to get something. Right. What I No. You mean ye Right. Right, right. Yes, that's true. But I mean that's maybe even the advantage here because you it's not speech d based, which means we are not Right. So you can when I tried and comparing with L_P_C_ ten on a uh twice bigger or higher bit rate, this work works better, it seems to me. Just for unvoiced speech. O I'm not mentioning uh voiceless because that's unvoiced, which means whispered. Uh yeah, but uh when you hear and try to use some examples with the music for example Right, yeah. Mm-hmm. That's true, yeah. Uh but anyway, it seemed to me pretty good. I mean that there might be. that's true, yeah. Uh it should be uh thirteenth December I think. I just uh you won't be here. Uh okay. No, I just swapped it with uh Mike Perrot I think 'cause h he is not going to be here this time. And I was supposed to have it in uh in January. So whatever. Anyway. So But anyway, I I wanted to hear something maybe from you what you are doing f with speech coding because you were you mentioned that By But it was based on like you were using kind of recognizer there, right? Uh-huh. Right. Right. Yeah, it's I'm I mean yeah, it's different. But uh it's speech coding at least. I think so. Right. Yeah, sure. Band you mean? Right. With different I don't know. But maybe what we can already do is uh to decrease this uh this distance. We can do a Hilbert envelope approximation with one second. But then we can split that signal into, I don't know, ten ten uh segments with the one hundred milliseconds or even less, and we can apply some other technique for that, you know. Still Right. Exactly, exactly. Yeah. Well, th more less they don't care what's uh what will be the quality of the of the or the bit rate, they just want to get something new, you know. So if you get the v very uh low bit rate with a reasonable quality or really high bit rate, which is c again different, but with uh high quality, uh they would be happy with the both. So You know, on one side it's pretty good but on the other you don't know which way you should go 'cause there are many of them. And uh Right, right. Okay. Well I I will W uh do you want to mention your speech coding experiments? So you are saying that you're doing something i in India with uh So you know the sub mm like um more details? Oh yeah. Okay. Right, right. Right. I see. Yeah, but it's quite interesting with the even with th uh s current speech coding uh technologies. We are they are still trying to use k zap or those R_P_ based approaches. There is nothing new, more or less. And if I when I was uh listening the uh the talk of uh Milan Jelinek, uh he's the Czech guy, but he lives in uh in Canada. I don't know the name of the university. But they they have uh the pattern I think for CELP even. Or Um I don't know how it is exactly but they got very very famous for this for that CELP. What they did exactly I don't know, if something different a little bit or not, or they just keep those patterns or I don't know how it is exactly. Maybe Hynek will know much more. But uh when I heard those that presentation, they didn't do so much new. I mean uh it was last year and uh I think Hynek, he was asking wh so what's new that you are so famous with that and they n they said okay, we we know exactly how to do that. I mean you know, not to make any errors like in because everybody knows how to do that. But uh it's not so easy to implement it, right. So uh there is nothing so much new really like. Yeah. Right. Right. Exactly. Right. Right, ri right, exactly, exactly. But then they are s still using like error signal, which is going to be approximated by code book, right? And a linear prediction, which is use which is approximating just the spectrum of for for and everything that that's that's all. So still Right. Yeah, yeah. Right. Well, I hope that we still have a a lot of time for that because uh I think yeah, it's pretty interesting, yeah. For example when I was uh you know, harmonic voice modelling, that H_M_M_ system which is using used for synthesis more less. It's pretty interesting. I mean it it works it works well. Right. Right. Yeah, yeah, it is. You mean uh in sinusoidal uh modelling? Yeah. And but anyway, do you know exactly how it is done when you really have those M_F_C_C_ coefficients, which means magnitude spectrum, and then you are trying to reconstruct the speech? Without a phase. Because I I tried you know those th those papers. But s I'm little confused about it. I don't know how it works. But Which one? Yeah, that would be nice. Oh. You know, m but there are some algorithms which were quite uh Yeah, yeah, yeah. Yeah. Uh it seems to me, yeah, something like that. Like try You know what you can do um maybe. What you can do is uh take just the minimum phase of which you can get from a linear prediction, right? You know what I mean, minimum phase signal. And it sounds quite uh reasonably the two. But of course they are li it's not original at all. So I don't know how they do that exactly. No, I've no idea. Right. True, right. For n for Aurora we didn't do anything with that. Yeah, but we didn't do any speech recon re reconstruction. Yeah. Yeah, yeah, sure. Right. Right. Right. Right. You mean with this phase uh sure. Yeah. Yeah, they do some kind of reconstruction of speech. Uh yeah. Right. That's true, yeah. Right. Yeah. Yeah. So Yep. Yep. Mm-hmm. Mm-hmm. Sure, yeah. Okay. So how about you? Yeah. Yeah. You mean like channel and source coding together? Or no. Well I can imagine kind of like J_ PEG operations, which might be more less like it's source coding and then we also channel coding because we just smooth the or just I don't know. I don't know. Right. Right. Right. R right, right. I don't know. I don't know so much about s channel coding, like those Huffman coding stuff and, you know. I mean it's just standard. Yeah. They're just standard techniques. But Yeah, exactly. Yeah, yeah. I think so. H yeah. end. Okay. Thank you guys. Uh if you find some good paper which you might There was that fam fam waveform pro prototyping cor
